Pot-luck challenge: Fact Sheet for the PROMO Dataset

ثبت نشده
چکیده

The PROMO dataset proposes the task to identify which promotions affect sales. Artificial data about 1000 promotion variables and 100 product sales is provided. The goal is to predict a 1000 × 100 boolean influence matrix, indicating for each (i, j) element whether the ith promotion has a causal influence of the sales of the jth product. Data is provided as time series, with a daily value for each variable for three years (i.e., 1095 days). Each of the 100 products has a defined seasonal baseline, repeating over the years. The seasonal effect can vary from almost inexistent to major. On top of this baseline are promotions. Each product is influenced by between 1 and 50 promotions out of the 1000 promotions available. Promotions usually increase the sales with respect to the baseline, but can occasionally reduce them (e.g., when a similar competing product is promoted, that promotion might have a negative effect on the sales of the current product). On top of that are daily variations. Each of the 1000 promotions can be seasonal or not; i.e., they can have the same pattern from one year to another or be completely different. The average time a promotion stays active or inactive, however, is constant for each promotion. The weighted normalized influence matrix is provided for result evaluation. It is normalized so that the maximum positive contribution is 1 and the maximum negative contribution is −1, and each nonzero (i, j) entry is weighted by how much promotion i affects product j. Note that, as this matrix is provided, the participants are trusted to use it for evaluation purposes only, and not to tune potential hyperparameter of their approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pot-luck challenge: FACT SHEET

We designed four datasets for the purpose of benchmarking local causal discovery algorithms. These include two “re-simulated” datasets obtained from artificially generated data from models trained with real data and two datasets including real variables intermixed with artificial variables (called probes). There is no time dependency in the samples. We chose applications in marketing, pharmacol...

متن کامل

AlvsPK Challenge: FACT SHEET

Table 1: Our best results (used only last 5 complete entries) Dataset Entry ID Entry Name Track Test BER Test AUC ADA 1026 vn5 Agnos 0.1751 0.8331 ADA 1024 vn3 Prior 0.1788 0.8225 GINA 1023 vn2 Prior 0.0226 0.9777 GINA 1025 vn4 Agnos 0.0503 0.9507 HIVA 1024 vn3 Agnos 0.2904 0.7343 NOVA 1026 vn5 Agnos 0.0471 0.9456 SYLVA 1024 vn3 Prior 0.0071 0.9959 SYLVA 1025 vn4 Agnos 0.0096 0.9933 Overall 102...

متن کامل

Utility of real-time prospective motion correction (PROMO) on 3D T1-weighted imaging in automated brain structure measurements

PROspective MOtion correction (PROMO) can prevent motion artefacts. The aim of this study was to determine whether brain structure measurements of motion-corrected images with PROMO were reliable and equivalent to conventional images without motion artefacts. The following T1-weighted images were obtained in healthy subjects: (A) resting scans with and without PROMO and (B) two types of motion ...

متن کامل

Automatic segmentation of glioma tumors from BraTS 2018 challenge dataset using a 2D U-Net network

Background: Glioma is the most common primary brain tumor, and early detection of tumors is important in the treatment planning for the patient. The precise segmentation of the tumor and intratumoral areas on the MRI by a radiologist is the first step in the diagnosis, which, in addition to the consuming time, can also receive different diagnoses from different physicians. The aim of this study...

متن کامل

The Use of Robust Factor Analysis of Compositional Geochemical Data for the Recognition of the Target Area in Khusf 1:100000 Sheet, South Khorasan, Iran

The closed nature of geochemical data has been proven in many studies. Compositional data have special properties that mean that standard statistical methods cannot be used to analyse them. These data imply a particular geometry called Aitchison geometry in the simplex space. For analysis, the dataset must first be opened by the various transformations provided. One of the most popular of the a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009